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Abstract 

The subject of these lectures is a calculus of functions for deriving 
programs from their specifications. This calculus consists of a range of 
concepts and notations for defining functions over various data types 
(including lists, trees and arrays), together with their algebraic and 
other properties. Each lecture begins with a specific problem, and the 
theory necessary to solve it is then developed. In this way we hope to 
show that a functional approach to the problem of systematically cal- 
culating programs from their specifications can take its place alongside 
other methodologies. 
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1 Basic concepts 



1.0 Problem 

Consider the following simple identity: 

(a, X Oi x a 3 ) + (fl2 X a 3 ) + a 3 + 1 = ((1 X a x + 1) X aa + 1) X a 3 + 1 

This equation generalises in the obvious way to n values oi, a2j ■ ■ • 1 a n and 
we will refer to it subsequently as Horner's rule. As we shall see, Horner's 
rule turns out to be quite useful in our calculus. The reason is that the inter- 
pretation of X and + need not be confined to the usual multiplication and 
addition operations of arithmetic. The essential constraints are only that 
both operations are associative, X has identity element 1, and X distributes 
through -+. 

The problem we address in this lecture is to develop suitable notation 
for expressing Horner's rule concisely. 

1.1 Functions 

Except where otherwise stated all functions are assumed to be totaJ. The 
fact that a function / has source type a and target type j3 will be denoted 
in the usual way by / : a (i. We shall say that / takes arguments in a 
and returns results in 0. 

Functional application is written without brackets; thus / a means /(a). 
Functions are curried and application associates to the left, so } ab means 
(/ a) b and not f(a b). Where convenient we will write /„ 6 as an alternative 
to / ab. Functional application is more binding than any other operation, 
so / a © 6 means (/ a) © 6 and not /(a ffi b). 

Functional composition is denoted by a centralised dot ( •). We have 

(/ -ff)« = /(ff«) 

Various symbols will be used to denote general binary operators, in partic- 
ular, ©, ®, and ® will be used frequently. No particular properties of an 
operator should be inferred from its shape. For example, depending on the 
context, ffi may or may not denote a commutative operation. 

Binary operators can be sectioned. This means that (ffi), (a®) and (®a) 
all denote functions. The definitions arc: 

(ffi) ab = a® b 
(a©) 6 = a® b 
(®6) a — a ffi 6 
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Thus, if © has type © : a x 0 — ► 7, then we have 

(ffi) : a -»/?-» 7 
(a®) : /?-»7 
(©6) : a-»7 

for all a in a and b in /?. 

For example, one way of stating that functional composition is associa- 
tive is to write 

(/■)•(*■) = ((/•?)■) 
To improve readability we shall often write sections without their full com- 
plement of brackets. For example, the above equation can be written as 

(/•) •(»■) = (/ •?)• 

The identity element of © : a x a — > a, if it exists, will be denoted by id&. 
Thus, 

a © i<ij, = idj, © a = a 

However, the identity element of functional composition (over functions of 
type q a) will be denoted by t'4»- Thus 

idc, a = a 

for all a in a. 

The constant valued function K : 0 — > (i — > a is defined by the equation 

Kab = a 

for all a in a and b in 13. We sometimes write if a as an alternative to K a. 
1.2 Lists 

Lists are finite sequences of values of the same type. We use the notation [a] 
to describe the type of lists whose elements have type a. Lists will be denoted 
using square brackets; for example [1,2,1] is a list of three numbers, and 
[[1], [1,2], [1, 2,1]] is a list of three lists of numbers. The function [•] : a -» [a] 
maps elements of a into singleton lists. Thus 

[•]a = H 
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The primitive operation on lists is concatenation, denoted by the sign -ff . 
For example: 

[l]-H-[2] * [1] = [1,2,1] 

Concatenation is an associative operation, that is, 

x -H- (y -ft- z) = (x -H- y) -B- z 

for all lists x, y and z in [a]. 

In the majority of situations (though not all) it is convenient to assume 
the existence of a special list, denoted by [] and called the empty list, which 
acts as the identity element of concatenation. Thus, 

i -H- [] = [] -H-z = r 
for all x in [a]. To distinguish this possibility, we shall let [a] denote the 
type of lists over a including [], and [q]+ the type of lists over a excluding 
[]. Using algebraic terminology, ([a], -H-,[]) is a monoid, while -H-) is 

a semigroup. 

In order to specify functions over lists we need one more assumption, 
namely that ([a], -H-, []) is the free monoid generated by a under the assign- 
ment [■] : a — ► [a]. This algebraic statement is equivalent to the assertion 
that for each function f : a —* ft and associative operator ffi : (i x /3 — > /3, 
the three equations 

h[] = (da 

h[a) = fa 

h(x -H- y) = hx © ky 

specify a unique function h : [a] — ► p. In the case that id® is not defined, the 
last two equations by themselves determine a uuique fnnction h : [a] + — * {5. 

Any function h satisfying the first and third equations above is, by 
definition, a homomorph/sm from the monoid ([a], -If, []) to the monoid 
(0,(B,id$). The statement that ([a],-H-,[]) is free is equivalent to the state- 
ment that h is uniquely determined by its values on singletons. We shall 
discuss homomorphisms in greater detail in the next lecture. 

One simple example of a homomorphism is provided by the function 
# : [a] — t N which returns the length of a list. Here N denotes the natural 
numbers {0, 1, . . .}. We have 

#[] = 0 

Observe that + is associative with identity 0, so (N , +,0) is a monoid. 



3 



1.3 Bags and sets 

By definition, a (finite) bag is a list in which the order of the elements is 
ignored. Bags are constructed by adding the rule that -H- is commutative 
as well as associative. Also by definition, a (finite) set is a bag in which 
repetitious of elements are ignored. Sets are constructed by adding the rule 
that -ff is idempotent as well as commutative and associative. As we shall 
see, much of the theory developed below holds for bags and sets as well as 
lists. 

In the main we shall use the single operator -H- for all three structures, 
relying on context to resolve ambiguity. However, in order to distinguish 
different uses in one and the same expression, we shall sometimes use W 
(bag union) for the concatenation operator on bags, and U (set union) for 
the same operator ou sets. Singleton bags are denoted by \ a\ and singleton 
sets by {a}. 

A similar algebraic statement about freeness holds for bags and sets as 
well as lists. We assume that {la\, W, { j) is the free commutative monoid 
generated by a under the assignment {■}:<*-» \a]. Similarly, ({a}, U, { }) 
is the free commutative and idempotent monoid generated by a under the 
assignment {•}:£>—> {a}. In the case of bags this means that for each 
f : a —* (3 and associative and commutative operator © : (5 X j3 — * /3, the 
equations 

h\al = fa 

h(x Wjf) = hx ® hy 

define a unique function h : JaJ — » 0. Similar remarks apply to sets, except 
that we also require (B to be idempotent. 

For example, the size of a bag is the number of elements it contains, 
counting repetitions. It can be defined by the equations 

#11 =0 

#W = i 

#(arUy) = 

However, although + is associative and commutative, it is not idempotent, 
so the same equations do not define the size function on sets. 
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1.4 Map 



The operator * (pronounced 'map') takes a function on its left and a List on 
its right. Informally, we have 

/ *[fli,a2,-..,o n ] = 1/ a u f d2,...,f <!„]■ 

Formally, we specify /* by three equations 

/*[] = (I 

f*[a] = \fa] 

f * (x -H- y) = {f * x) -H- (/ * y) 

Thus, for / : a ~ > /3 the function /+ is a homomorphism from {[a], -ff , []) to 
+,[]). This function is a homomorphism on bags and sets as well as 

lists. 

An important property of + is that it distributes (backwards) through 
functional composition: 

(/■fl)* = (/*)-(fl*) 

This fact wjll be referred to as the (* distributivity) rule. Its use in cal- 
culations is so frequent that we shall sometimes employ it without explicit 
mention. 



1.5 Reduce 

The operator / (pronounced 'reduce') takes a binary operator on its left and 
a list on its right. Informally, we have 

®/[<Jl , <J2, • • • , On] = <J] © 02 © - ■ • © An 

Formally, we specify ©/, where © is associative, by three equations 

©/[] = t£^ (if id® exists) 

©/[a] = a 

®/{x-H-y) = (®/x)ffi(©/y) 

If © is commutative as well as associative, then © / can be applied to bags; 
and if © is also idempotent, then ©/ can be applied to sets. 

If © does not have an identity element, then we can regard ©/ as a func- 
tion of type ©/ : [or]+ — ► a. Alternatively, we can invent an extra value and 



5 



adjoin it to a. Provided a little care is taken, so-called 'fictitious' identity 
elements can always be added to a domain. For example, the minimum 
operator [ defined by 

ail = a if a ^ 6 
= 6 otherwise 

has no identity element in R (the domain of real numbers). However, the 
fictitious lumber oo can be adjoined to R. The only property attributed to 
oo is that 

aj.oo = oo|o = a 

for ail a in R U {oo}. In this way, inconsistency is avoided. 

Two useful functions where we choose not to invent identity elements 
are given by 

head = < / 
last = > / 

wbere the operators < ('left') and > ('right') are denned by 

o <K b = a 
a » b = b 

The function head selects tbe first element of a non-empty list, while last 
selects the last element. Both < and » are associative but neither pos- 
sesses an identity element. Both operators are idempotent, but neither is 
commutative, so head and last are not defined on bags and sets. 

1.6 Promotion 

The equations defining /* and ©/ can be expressed as identities between 
functions. They come in three groups: 



empty rules 



one-point rules 



f*-K\) = K[] 
9/-K[] = id* 



/*■ [■] = [■]■/ 
©/•[■] = id 
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join rules 



/* ■ */ = */ " (/♦)* 
©/ • -H-/ = ©/ ■ (©/)* 



The interesting rules here are the last two, since they are not simple tran- 
scriptions of the third equations for /* and ©/. A rough and ready justifi- 
cation of the join rule for /* is as follows: 



/* -H-/[«i, -.,««] 

= / * (xi -H- 2j -H- H- z„) 

= (/*xi)-H-(/*i2)-H-----H-(/*x„) 

= -H-/[/**l./*^,---,/*Xn] 



A similar justification can be given for the join rule for ©/. We shall refer 
to these two rules as map promotion and reduce promotion respectively. 
The nomenclature is historical and is intended to express the idea that an 
operation on a compound structure can be 'promoted' into its components. 

The map and promotion rules are often applied in sequence. Consider the 
following little calculation: 



These three steps will, in future, be compressed into one undei the name 
map and reduce promotion. 

1.7 Directed reductions 

We now introduce two more operators -ft (pronounced 'left- to-right reduce', 
or just 'left reduce') and / ('right -to-left reduce') which are closely related 
to /. Informally, we have 



© / • { '* • -H- / ■ g* — map promotion 



©/ • -H-/ - / * * ■ g* 



= reduce promotion 
©/ ■ ©/* 



= * distributivity 

©/ ■ (©/ • /* • g)* 



© j>e [111,02,.. .,<vl 
© *fe [ai,a 2 ,...,a J> ] 



((e © di) © 02) © ■ ■ • © a* 

d\ © (02 © ■ ■ - © (On © e)) 
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In particular, we have 

®/e[] = e 
®+e[] = e 

Thus, the parameter e (called the starting vaiue or seed) defines the value 
of the reduction on empty lists. 

Notice in a left-reduce that the brackets group from the left, and in a 
right-reduce they group from the right. Since an order of association is 
provided, the operator © of a left or right-reduction need not be associative. 
In fact, in a left-reduce we can have © : 0 x a — ^ P and in a right-reduce 
© : a x ji — > P. Thus, for a given seed e in p, the types of -f* e and j> e aje 

/ e : (/? x a ^ 0) -» [or] - 0 
ie : (axP ^ P)^[a]^ p 

Formally, we can define ffi^e on lists by two equations: 

©/e \x -H- [a]) = (©/e^)©a 

These equations are different in form to previous definitions, reflecting the 
fact that is not a homomorphism on lists. However, since every non- 
empty list can be expressed uniquely in the form n-H-[a], these two equations 
determine ©/e uniquely. (This point is dealt with in the third lecture). A 
similar definition holds for ®/ e . 

Both kinds of reduction can be applied to bags and sets provided © 
satisfies additional conditions, designed to ensure that the result of a directed 
reduction is independent of any particular representation of the bag or set. 
For ©/»e to be defined on bags we require that 

(b © <ii) © 02 ~ {b © 02) © «i 

for all b in p and 01,02 in Q- For sets we need the extra condition 

(b © a) © a — 6 © a 

Similar conditions are required for right-reductions. These remarks are given 
for completeness for we shall have little occasion to apply directed reductions 
to bags or sets. 

It is convenient for some purposes to define a more restricted form of directed 
reduction in which the seed is omitted. Informally, we have that 

0/ [fli,02,---,0»l = (( a i ® °2) ® °3) © • ' ' © °» 
© / [ai, 02, • • • , Onj = "1 ® (°2 © ■ • • © (<«n-l © 
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Note that the type of © in this kind of directed reduce mast be of the form 
© : q x a — » a. The value of © -j* [] is not defined unless © possesses a 
unique left-identity element, i.e. a value e satisfying 

e ffi a = a 

for all a. If such a value exists and is unique, we can set 

9 + [] = e 

Similarly, ffi / [] is defined only if © has a unique right identity element. 

There are a number of properties relating the various forms of directed 
reduction. For example, we have 

(©» ■ ([<»]+) = ©fa 

(©/)-(+[«]) = ®ta 

Other properties will be considered in a later lecture. For the present we 
give just one illustration of the use of left-reduce. 

Recall from the first section that the right-hand side of Horner's rule 
reads 

(((1 X d] + 1) X 02 + 1) X ■ ■ ■ + 1) X 0„ + 1 

This expression can be written using a left-reduce: 

® >1 [<*i,aj,...,fln] 
where the operator ® is defined by 

a®i = (oxl)+l 
It is interesting to compare this with another form of Horner's rnle: 
(ai X 02 X a 3 ) + (oj x a 3 ) + a 3 = ((a x x a 2 + a?) x o 3 + a 3 
Here, the general form of the right-hand side can be written as 

® [a u a2, . ..,0*], 
where, this time, © is defined by 

a®b = (axb) + b 
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1.8 Accumulations 

With each form of directed reduction over lists there corresponds a form of 
computation called an accumulation. These forms are expressed with the 
operators -jf* ('left-accumulate 1 ) and <ft- ('right-accumulate') and are defined 
informally by 

© #e[ai,<i2,...,a„] = [e,ee a l7 . ..,((e© a x ) © 03) © •• ■ © a„] 
© #e[ai,aa,---,a n ] = K ® ("2 © • ■ • © (o» © e)), . . . , © e, e] 

Formally, we have 

©#e[] = [e] 

®/e([°] + x) = [e] -H- (© #e © a *) 

Alternatively, we can define a loft- accumulation by 

©# c (x4fH) = (© #e x) -H- [6 © a] 

where 6 — iost(© x) 

Yet a third way of defining ®-§*e w iU be given in the next section. The 
definitions of the other accumulation operators are similiar and we omit 
details. 

Observe from the above equations that © -jf* e z can be evaluated with n 
calculations of ffi, where n = #2. For example, the list [0!, l!, . . . , n!] of the 
first (n + 1) factorial numbers can be computed by evaluating 

X [l,2,...,n] 

This requires 0(n) steps, whereas the alternative 

fact * [0, 1 , . . . , n] , 

where fact k = x/[l,2, . . . , k], requires 0(n 2 ) steps. Also amusing is the 
fact that 

Note that 

ffi/e - last ■ ®/ e 

so left-reductions can be defined in terms of left-accumulations. On the 
other hand, we also have 

©#e = ®/[ e ] 
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where 



x ® a — x -H- [last i © a] 



Hence left- accumulations can be defined in terms of left-reductions. 
1.9 Segments 

A list y is a segment of x if there exist tt and v such that 2 = u -H- y # v. If 
tt = [], then j/ is an initial segment, and if v = [], then y is a final segment. 

The function inits returns the list of initial segments of a list, inincreas 
ing order of length. The function toils returns the list of final segments of a 
list, in decreasing order of length. Thus, informally, we have 

inits[ai,02,...,a B ] = [[}, [a{\, [ ai , 02], . . . , K, 03, - - - , a n ]] 
tailslai,^,...,^] - [[o 1 ,fl2,...,a n ],[o2,03,...,a n ], ...,[]] 

The functions inits+ and tatfs+ are similar, except that the empty list does 
not appear in the result. 

These four functions can be defined by accumulations. For example, 



Alternatively, we can define these functions by explicit recursion equations. 
For example: 



The following result shows another way we can define accumulations. 
Accumulation lemma 



There are similar equations for right- accumulations. The accumulation 
lemma, is used frequently in the derivation of efficient algorithms for prob- 
lems about segments. On lists of length n, evaluation of the left-hand side 
requires O(n) computations involving ©, while the right-hand side requires 
0(n 2 ) computations. 




imts+ = (-H-#) ■ [•}* 



taib[] = {[}] 

taib(x -H- [a]) = (-H-[a]) * tails x -H- [[]] 



(ffi/e) = (®fi e ) *• inits 
(©#) = (ffi/)*-tnits+ 
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The functions segs returns a. list of all segments of a list, and segs + returns 
a list of all non-empty segments, A convenient definition is 

segs = 4f / ■ tails + ■ tnits 

For example, 

segs[l, 2, 3] = [[ ], [1], [], [1 , 2], [2], [), (1, 2, 3], [2, 3], [3], []] 

Notice that the empty list [] appears four times in deyd[l,2,3] (and not at 
all in segs+[l,2,3]). The order in which the elements of segs x appear is not 
important for our purposes and we shall make no use of it. In effect, we 
regard segsx as a bag of lists. This identification can be made explicit by 
introducing a 'bagifying' function 

bag = W/ • H+ 

which converts a list into a bag of its elements (and is the identity function 
on bags). We can then define 

bagsegs = bag • segs 

However, explicit use of bag can be avoided in many examples. Consider a 
function of the form 

P — ffi / ■ / * • bagsegs 

where we must have that $ is commutative as well as associative. We can 
calculate 

P = definition of bagsegs 

©/ • / + -til/ ■ I ■ \ * -segs 
= map and reduce promotion 

©/ • (©/ •/*■?•!)* ~ se s s 

= one-point rules 
©/ ■ / * ■ segs 

and so bagsegs can be replaced by segs, 
1.10 Horner's rule 

Now let us return to the problem posed at the beginning. Horner's rule can 
be expressed as an equation 

©/ ■ ® / + ■ tails - ®-fi e 
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where e — id® and a ® b — (a ® b) © e. This equation is valid provided ® 
distributes (backwards) over ffi, that is, 

(a © 6) ® e = (a®c)ffi(6®c) 

for all a, 6 and c of the appropriate type. This condition is equivalent to 
the assertion that the equation 

(®c) .«/ = ©/. (®c)+ 

holds an all non-empty lists. If we also assume that id® is a left-zero element 
of ®, i.e., 

id® ® c =r id® 

for all c, then the restriction to non-empty lists in the above assertion can 
be dropped. 

Horner's rule is proved by induction. The idea is to show that 
/ = ©/■»/* - tails 

satisfies the equations 

/[] = e 

f(x -(f [a]) = fx® a 

The way to do this is to use the recursive characterisation of tails given in 
the previous section. We shall leave details to the reader. 

Horner's rule can be generalised in a number of ways. We cite just two. 
First, we have 

®/ - ®/ * -tails + = 

where a © 6 = (a ® 6) © b. This formulation, which was hinted at in Section 
1.8, does not require that id® be defined. We also have 

©/ ■ (®/ ' /*)* - tails = ®-f>e 

where e = id® and a® b = (a®/ b)@e. This particular form of Horner's rule 
will be used in the next lecture. There are also forms of the rule involving 
right-reductions and inits. 
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1.11 Application 

Let us give one application of Horner's rule. There is a famous problem, 
called the maximum segment sum (mss) problem, which is to compute the 
maximum of the sums of all segments of a given sequence of numbers, posi- 
tive, negative or zero. In symbols 

mss = | / • + /* • segs 

Direct evaluation of the right-hand side of this equation requires 0(n 3 ) steps 
on a list of length n. There are 0(n 2 ) segments and each can be summed 
in O(n) steps, giving 0(n 3 ) steps in all. Using Horner's rule it is easy to 
calculate an 0{n) algorithm: 

mss = definition 

T/*+/** se 9 s 
= definition of segs 

T/ • +/* ■ -H-/ ' tails* -inits 
= map and reduce promotion 

17 " (T/ ' +/* ' tails)* -inits 
— Horner's rule with a®b = (a+b)]0 

17 ■©/{)*■ i™ts 
= accumulation lemma 

T/ • ©#o 

Horner's rule is applicable because + distributes through 1, and 0 = id + . 
The result is a linear time algorithm. 

An interesting variation of the problem is not so well-known. It is to compute 
the maximum segment product. In symbols 

msp = 1 1 • x /* • segs 

Since x does not distribute through 1 for negative numbers, the previous 
derivation does not work. However, we do have 

(aTfe)xc = (aXc)T(frXc) if c JsO 
(aTfc)xc = (axc)l(bxc) if c ^ 0 

where J takes the minimum of its two arguments. A similar pair of equations 
holds for (o | b) X c. These facts are enough to ensure that, with suitable 
cunning, Horner's rule can be made to work. The idea is to define ffi by 

bl) © (02, &2) = (fll i 02, 6l T h) 
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and ® by 

(a, b)®c - (axe, b X c) if c ^ 0 
= (b x c,a x c) othor T ,visn 

Then, using the observations about f and 1 given above, we have thai 

((oj, &0© (fl2,6a)) & c = ((a ]( 6i)® c) © ((03,62)® c) 

and so ® distributes backwards through ffi. 
Now define 

fx - {l/(x/*segax),V(x/ *segsx)) 
A similar calculation to before shows that 

/ = ©/ ■ ®#e 

where e = (1, 1), and 

(a,b)®c = ({a,b)<8 c)e(l,l) 

Hence we have 

msp = jt 2 ■ ©/ • ®#e 
where ^(a, fr) = 6. Again this is a linear time algorithm. 

1.12 Segment decomposition 

The sequence of calculation steps given in the derivation of the mss problem 
arises frequently. Here is the essential idea expressed as a general theorem. 

Theorem 1 (Segment decomposition) Suppose S and T are defined by 

S — ©/■/*■ segs 
T = ©/•/*■ tails 

If T can be expressed in the form T = h ■ ®-fre, then we have 

5 ~ ©/ - h * ■ ® # e 
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Proof 

We calculate 

S = given 

©/-/*■ segs 
= definition of segs 

©/ ' /* ■ -H-/ ' * -intts 
= map and reduce promotion 

©/ " (©/ ' /* " + -inits 
— hypothesis on T 

©/■(/»■ * 
= * distributivity 

©/ ■ h * ■ ® / e + -intts 
= accumulation lemma 

©/-/>*■ ® / e 

O 
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[6] Rem, M. Small programming exercises. Science of Computer Pro- 
gramming (various issues) 

2 Homomorphisms 

2.0 Problem 

Given is a sequence x and a predicate p. Required is an efficient algorithm 
for computing some longest segment of x, all of whose elements satisfy p. 

2.1 Homomorphisms 

By definition, a homomorphism from a monoid (a, SB, «^) to a monoid 
(/?,®,ta^) is a function h satisfying the two equations 

h id& - id® 

k(x (By) = h z ® hy 

Equivalently, using the map and reduction operators introduced in the first 
lecture, h is a homomorphism if 

k ■ ©/ = ®/ - h* 

We omit the proof (by induction) that these two definitions are equivalent. 

Since the map function /* is a homomorphism from monoid ([a], -if , []) 
to monoid ([/?], -H-, []) whenever / : a — » /?, we have, as an immediate conse- 
quence of the second characterisation of homomorphisms, that 

/* ■ -ft-/ = -H-/ ■ /** 

This is just the map promotion rule of the previous lecture. Likewise, the 
reduce promotion rule is an immediate consequence of the fact that ©/ is a 
homomorphism . 
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Our basic assumption, namely that ([a],-H-, []) is a free monoid, is equiv- 
alent to the condition that for each monoid there is a unique 
homomorphism h from ([q],+,[]) to (/?,©, This homomorphism is 
determined by the values of h on singletons. That is, for each / : a — ► /3, 
the additional equation 

h[a] = /o 

fixes h completely. 

The following lemma gives a useful characterisation of homomorphisms 
on lists. 

Lemma 2 Every homomorphism on lists can be expressed as the composi- 
tion of a reduction with a map, and every such combina tion is a homomor- 
phism. Afore precisely, suppose 

h[] = id* 

fc[a] = /« 

h(x -H- y) = hx®hy 

Then h - ©/ ■/*. Conversely, ifk has this form, then h is a homomorphism. 
Proof 

The proof of the homomorphism lemma uses the following simple result 
(called the identity lemma), whose proof will not be given. 

4f/ ■ [■]* = ufo 

We now calculate: 

h = definition of id 
h - id 

— identity lemma 

&■*/•[■]* 
= h is a homomorphism 
©/ ■ fc* ■ [•]* 

— * distributivity 

©/■ (fc •[-])* 

— definition of h on singletons 
©/ ■/* 
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Conversely, we reason that h = ©/•/* is a homomorphism by calcu- 
lating 

h ■ -Wl ~ given form for h 
©/■/*■ 4f / 
= map and reduce promotion 

©/ • (©/•/*)* 
= hypothesis 

□ 

Many functions on lists axe homomorphisms and we shall see examples 
below. However, not all of the functions we can define in terms of homo- 
morphisms will themselves be homomorphisms. We shall take up this point 
in the following lecture. 

2.2 Examples 

Let us consider some examples of homomorphisms on lists. 

(1) First of all, the function # is a homomorphism: 

# = +/ ■ *i* 

(2) Second, the function reverse which reverses the order of the elements in 
a list is a homomorphism: 

reverse — -|f / ■ [■]* 

where x-H-y = jr-H-x. (In general, we define ffi by the equation x®y = y(Bx.) 
Of course, on bags and sets, where -(f = -ff, the function reverse is just the 
identity function. 

(3) The function sort which reorders the elements of a list into ascending 
order is a homomorphism: 

sort = ffif - [•]* 
Here, fl\ (pronounced 'merge') is defined by the equations 
x * [] = x 

= y 

([a] 4f x) * ([4] y) = [a] (x * ([b] -4+ y)) if a $ b 
= [b>. I -H- {{[a] -H- *) * y) otherwise 
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Thus, x H\ y is the result of merging two sorted lists x and y. Since /ft is 
both associative and commutative, the function sort can be applied to bags. 
By defining a variant of fl\ that removes duplicates, so that the operation is 
also idempotent, we can sort sets. 

(4) Two useful homomorphisms are all and some: 

all p — a/ - p* 
some p — V/ ■ p* 

Here, A is logical conjunction and V is logical disjunction. The function allp 
applied to a list x returns True if every element of x satisfies the predicate 
p, and False otherwise. The function somep applied to x returns True if 
at least one element of x satisfies p, and false otherwise. Since conjunction 
and disjunction are associative, commutative and idempotent operations, 
all and some can be applied to bags and sets as well as lists. 

(5) The function split : [a] + —* [a] X a, which splits a non-empty list into 
its last element and the remainder, is a homomorphism: 

split[a] = ([],a) 

split(x -H- y) = split x ® split y 

where we define © by 

(x, a) © (y, b) = {x -H- [a] -H- J/, b) 

In particular, we can define 

init = 7Ti • split 

where Ti(u, v) = u. Unlike last (which is ttj ■ split), the function init is not 
a homomorphism. Note that the homomorphisms described in this example 
are homomorphisms on the semigroup ([a] + , -H-). 

Using init and last, we can define the function tails of the last lecture 
as a homomorphism 

taiis = ©/•/* 

where 

fa = [[],[a]] 

xs®ys = init xs -ff (last xs-ft) + ys 

A simple calculation shows that id® = [[]], so we have tails[] = [[]], as 
expected. 
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2.3 All applied to 



In order to be able to describe homomorphisms, such as tails, a little more 
concisely, it is useful to introduce an operator 0 (pronounced 'all applied to') 
denned by 

IT" = [] 

(fs^gs)°a = (fs° a) 4f (gs° a) 
Less formally, we have 

\f,g,...,h]"a = \fa,ga, ...,ka] 

Thus 0 takes a sequence of functions and a value and returns the result of 
applying each function to the value. Note that (° a) is a homomorphism. 
Note also that the notation [■] we have been using so far can be rewritten 
as [id]". 

We can now write, for example, 

tails = ©/ • [[] 0 ,[i<*]°]% 

2.4 Conditional expressions 

So far, we have been using the notation 

kx = fx if px 

- gx otherwise 

to describe functions defined by cases. From now on, we shall also use the 
McCarthy conditional form 

h = (p-> f,9) 

to describe the same function. Like the operator of the previous section, 
conditional forms can, in some situations, help to make the expression of 
homomorphisms and other functions more concise. 

There are a number of well-known laws about conditional forms, the 
most important of which are: 

h ■ (p -> f,9) = (P ~* h ■ f,h ■ g) 
(P~* f,9) ■ h = (P ■ h-+ f ■ h,g • h) 
(p-fJ) = f 

(Remember, all functions are assumed to be total, so these laws need no 
qualifications about definedness.) 
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2.5 Filter 



The operator < (pronounced 'filter') takes a predicate p and a list x and 
returns the sublist of x consisting, in order, of all those elements of x that 
satisfy p. Using the new notations just introduced, we can define p< as a 
homomorphism 

j>« = -H-/-0> -»[«*]",[]> 

In effect, p<x is obtained by replacing each element a of z by [a] if p a holds, 
and [] otherwise, and then concatenating the resulting lists. Note that p< 
can be applied to bags and sets as well as lists. 

An easy calculation shows that the following rule (which we will call 
filter promotion) holds: 

(P<) • -H-/ = -H-/ ■ (H* 

Another rule, whose proof is also left to the reader, is the map-filter swap 
rule: 

P< ■}*=}*■ {P ■ f)< 

2.6 Cross-product 

If © is a binary operator, then X e is a binary operator that takes two lists 
i and y and returns a list of values of the form o © 6 for all a in x and 6 in 
y. For example: 

[o, b] X® [c, d, e] = [a © c, 6 © c, a © d, b © d, a © e, b © e] 

Formally, we define X e by three equations: 

***[] = [] 

x x ® [ a ] = (®a)*z 

xX e (y-H-z) = (x X e y) 4f (x X e z) 

Thus (zX$) is a homomorphism (on lists, bags or sets) for every x. 

There axe a number of useful properties of X$. We shall state them 
without proof. 

First of all, X s is associative if © is, and commutative if ffi is. It is not, 
in general, idempotent if © is. 

Next, [] is the zero element of X$, that is, 

[] X e x = x X® [] = (] 



22 



for all x. 

We also have the cross promotion rules: 

f = xw-/*** 

©/* ■ X*/ = X®/-©/* 
Finally, we have that if ® distributes through ffi, then 

©/ ■ x®/ = ®/ • e/* 

This result says that the sum of the products is the product of the sums. 
We shall call it the cross-dsstcibutivity rule. 

The particular operator X + has many uses. For example, the cartesian 
product function cp : [[a]] — + [[a]], denned by 

cp = XW ■ [id]"* 

takes a list of lists and returns a list of lists of elements, one from each 
component. For example, 

cp[[a, b},[c],[d,e]] = [\a,c, d], [b, c, d], [a, c, e], [i>, c, e]] 

Second, the list subs x of all subsequences of x can be defined as the homo- 
morphism 

^=x+/-ir.i«!T* 

For example 

S ubs[a,b,c] = U/[[[inM[%[[Uc}}} 
and the expression on the right simplifies to 

[[], [a],[b},[ a , b],[c],[ a ,c],[b,c},[a,b,c]\ 

Third, we have 

(allp -> [id]°,\T) = X*/ • (p - [NT,[]> 
This technical result means that we can write all p< in the form 

o i(p< = 4f /.(x + /-(P-[MT ; []°)0* 

This daunting expression will make another appearance in the next section 
but one. 
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2.7 Selection operators 

Suppose / is a numeric valued function. We want to define an operator 
such that 

1. \j is associative, commutative and idempotent; 

2. jy is selective in that 

x Ty y - x or x l f y = y 

3. |y is maximising in that 

/(i T,y)=/*T/s 

If / is an injective function, then the above three conditions specify f/ 
completely (actually, idempotence follows from selectivity). If, however, / 
is not injective, then the value of x 1j y is not specified when x ^ y but 
fx = f y. For example, the value of 

[1.2] T # [3,4] 

is not determined hy the above conditions, beyond the fact that it must be 
one of [1,2] or [3,4], 

There are two ways to resolve such under- specifications. One is to forgo 
commutativity, defining for instance a left-biased version of f/ : 

x\ t y = x iifx^fy 
= y otherwise 

This solution is not very satisfactory because the calculation of expressions 
such as 

T#/ • P <■ -segs 

depends artificially on the precise order in which the function segs returns 
the list of segments of x (a feature which we said in the last lecture we would 
ignore). 

The alternative is to let |y stand for T/<> where /' is an injective func- 
tion, the precise nature of which we are not interested in, that respects the 
ordering on values given by /, that is, 

f x < f y implies f x < }' y 
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If necessary to ease a calculation, we can always introduce refinements of / 
(i.e. a function that respects the ordering of / but may introduce further 
distinctions), provided such refinements are consistent with all previous ones. 

One particular refinement of is especially useful and we impose it at 
the outset. We shall assume that -H- distributes through f^, in other words: 

x-H-(»t#-0 = (x -H- y) T# (x -H- z) 
(iT#y)-H-^ = (i -H- z) T# (y -H- z) 

Such a refinement arises if, for example, we always select the lexicographi- 
cally least sequence as the value of x f# y when #z = #y. 

Since we mainly do calculations at the function level, we would like to 
write the above distributive rules in the form 

(s-H-) • T#/ - T#/ ■ 

C-H-x) ■ T#/ = 

The missing piece which enables the two forms to be connected (without 
restricting ourselves to non-empty lists) concerns the fictitious value w = 
!#/[]■ This is not the empty list, but a very short list satisfying #u> — 
— oo. In other words, we want to suppose that w is the zero element of 
-H-. This decision can be couched in algebraic language: we suppose that 
([o],4f , T# , [ ] , w) is a semiring. In general, a semiring (S, X,+, id x ,id + ) 
is a set 5 closed under two associative operations X and +, with + also 
commutative, such that X distributes over +. Moreover, the identity element 
of + is the zero element of x. 

Let us put these assumptions to work in a short calculation: 
| # / ■ altpo 

— daunting expression for o//po from Section 2.7 

u/ - */ ■ ow ■ (p - [Mmn»* 

= reduce promotion 

t,/;(W-W;(?-[Mt[l> 

= semiring assumption and cross-distributivity 

T # /-(*/-T#/*-(p-[HT,[]»* 

= + distributivjty 

T # /- W-(T # /-Cp-[[WH°,[]»* 

= conditionals 

T#/-(-H-/-(p-T # /-[[«imT # /-[]»* 

= empty and one-point rules 
T#/ ■ (-H-/ • (p- [id]°,K w )*)* 
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We shall use this result in the next section. 
2.8 Solution 

The problem wo started the lecture with was to compute the longest segment 
of a list, all of whose elements satisfied some given property p. In symbols, 
we want to compute /, where 

/ = T#/ ' allp< -segs 

Let us calculate: 

T # / • ailp-a - segs 
= segment decomposition 

T#/ ■ (T#/ ■ allp< - tails)* • inits 
— result at end of last section 

T#/ • (T#/ • (-H7 • (P -* [id] 0 ,^)*)* ■ tails)* - inits 
= Horner's rule with z©a = (i4f(pa-» [a],w)) f # [] 

T#/ ■ ® /j] * • *™ ts 

accumulation lemma 

T#/ • ®#[] 

Finally, we can simplify x ® u to 

i©fl = (po-n# [«],[]) 

This is a, linear time algorithm (in the number of calculations of p). 

The derivation of the above program might seem a little elaborate, bring- 
ing in cross-products, semirings, fictitious elements and so on, just to crack 
a small walnut. The central aspect, namely that 

| # / • allp< ■ tails 

can be expressed as a left-reduction , can be established quite quickly by an 
induction proof, one that avoids all talk of zero elements of concatenation. 
However, we have succeeded in avoiding induction, used only equational 
reasoning, brought in a second application of a useful rule, and introduced 
some more concepts and notations. 
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2.9 References 

Farther discussion of some of the operators introduced above is in: 

[1] Bird, R.S. A calculus of functions for program derivation. Proc. In- 
stitute of Dec/arative Programming, University of Texas, USA, 1987. 
(Also available as a Programming Research Group Monograph PRG- 
64, Oxford, UK.) 

An extensive discussion of homomorphisms on trees, lists, bags and sets 
is in: 

[2] Backhouse, R. An exploration of the Bird-Meertens formalism. (Un- 
published draft), Dept. of Computer Science, Groningen University, 
The Netherlands. (1988) 

3 Left reductions 

3.0 Problem 

Given is a list of lists of numbers. Required is an efficient algorithm for com- 
puting the minimum of the maximum numbers in each list. More succinctly, 
we want to compute 

minimax = j/ • T/* 

as efficiently as possible. 

3.1 General equations 

So far, we have mainly seen examples of homomorphisms. It is instructive 
to determine the conditions under which a general set of equations 

h[] = c 

h[a] = fa 

A(s-H-jr) = B(x,yjxjy) 

determines a unique function h, not necessarily a homomorphism. After all, 
such sets of equations constitute the basic means for specifying functions on 
lists. 

Consider the equations 

h'[] = (M,e) 

h'[a] = ([«,],/<») 

h'{x -H- y) = h'xeh'y 
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where ffi is defined by 

{x, u) e (j/, v) = (z -H- y, E(x, y, u, v)) 
If h' is a well-defined function, then so is h. We have 

k = ir 2 ■ h' 

wheie jr 2 (a, b) — b. 

In order to determine the conditions under which the above equations 
determine h' , let /? be the smallest set of values such that 

1. ([],e)ism/?; 

2. ([a},f a) is in /3 for all a in a; 

3. (x, 11) © (y, v) is in /3 whenever {x, «) and (y, «) are. 

Now, by our basic assumption, h' is uniquely determined if (/?, ©, ([], e)) is 
a monoid. Translating the monoid conditions (associativity, and existence 
of an identity) into conditions on e and E , we must therefore have: 

1. H(x,[],u,e) = u 

2. H([],x,e,v) = « 

3. H(x 4f y,z,H{x,y,v,v),w) = y -+f u, z, t;, id)) 

These three conditions (the consistency conditions) determine the properties 
that E and e must satisfy in order for the equations for h to determine h 
completely. 

Let us consider one example. Take e = [] and 

E{x,y,M, v) - (xt - x -> v H- v,v) 

Here we use the McCarthy conditional form on the right. We leave the 
verification of the first two conditions on e and E to the reader. For the 
third condition, the expression E [x 4f j/, z, E (x, y, u, u),y) reduces to 

(u — xAv = y — >u-H-"-H-i"i 
u = xAv^y->u-H-t;, 
»^iAti = i-H-y- >u-H-u>, 
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On the otter hand, the expression H(x, y -ff z, v,H(y,z, v, w)) reduces to 

(tt = xhy = y->v-ti-v-H-ur, 
u—xAv^y— *ti-H-if, 
u ^ x — * u) 

These two expressions are not equal unless we have that 

u = iV u ^ x -ft- y 

for all (x,ti) in /3. This condition is equivalent to #x ^ #ti for all (r, u) in. 
P and is satisfied if 

1 

for all a. In particular, if we take 

/ = (P -[•],[]") 

for an arbitrary p, then everything is all right. With this definition of /, 
the value of h x is just the longest initial segment of x all of whose elements 
satisfy p. In symbols: 

h = j^/ • allp< ■ inits 

As a last point, we show that h is not a homomorphism. For concreteness 
take p = even, the predicate that determines whether a number is even. 
Suppose 

h{x -H- y) = hx e ky 
for some operator ffi. Since h[2, 1] = 2,&[4] = [4], and h[2] = [2], we have 

fc[2,l,4] = h[2,\}®k[A] 

= [2]e[4] 

= h[2]®k[4] 
= M2,4] 

This is a contradiction, since h[2, 1,4] = [2] and k[2,i] - [2,4], 
3.2 Left reductions 

We defined the directed reductions in the first lecture, but the pattern of the 
equations does not follow those laid down in the previous section. We really 
need to give an alternative definition in order to justify that, for example, 
©/ e is a well-defined function. 
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In the monoid view of lists, the formal definition of ©/»e is 

©/el] = e 

© y> e [a] = e © a 

© y>e (x -B- y) = © ;/y !/ where e' - © y- e x 

Equivalently, setting / e = ©/e, we have 

/e[] = e 

/ e [a] = e © a 

/e(x-H-jf) = f(fex)y 

We leave to the reader to check that the above equations satisfy the consis- 
tency conditions of the previous section. 

There is an instructive alternative way of seeing that ©/e is well-defined. 
Define h by 

h[] = id 

h[a] = (©a) 

h(x -fry) = hy ■ hx 

For © : j} x a — • /J we have that h is a homomorpliism 
A:{[«],-H-,|])-(^-^-.^) 

Now we have 

© /e 2 —hie 

and so ©y> e is a well-defined function. 

The above reasoning justifies the well-definedness of -f* e but does not 
explain why the construct is important. The basic reason why left reductions 
are important (and similar remarks apply to right reductions) is as follows. 

Consider a set of equations of the form 

/[] = e 

/(z-H-M) = F( a ,*J*) 

We claim that 

/ = tt 2 • ©y> e , 

where 

(i,u)©a = (z -H- [o],F(a,x,w)) 
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In brief, every set of equations of the above form can be expressed in terms 
of a left reduction. Conversely, we have 

®/ e [] = e 

e / e (x -H- [a]) - (© / e x) © a 

so an arbitrary left reduction can be expressed in the same way. 

The above discussion can be summarised in terms of the different ways 
we can view lists. The monoid view of lists is to say that every list is either 

(i) the empty list; (ii) a singleton list; or (iii) the concatenation of two (non- 
empty) lists. The primary mechanism for denning functions with this view 
is the homomorphism. Another view of lists is that every list is either (i) 
the empty list; or (ii) of the form x -H- [a] for some list x and value a. The 
primary mechanism in this case is the left reduction. 

Yet a third view of lists is that every list is either (i) the empty list; or 

(ii) of the form [a] -H- x for some a and list x. The primary mechanism here is 
the right reduction. In the majority of functional programming languages, 
it is this third view that prevails. One of the reasons concerns the possibility 
of defining functions on infinite lists, a reason that we will not go into here. 
Fortnnately, we can define left reductions with this view as well. We have 

©7*e[] = e 

© -he ([ a ] -H- x) — © / e / x where e' = e © a 
We leave the verification of this fact to the reader. 

3.3 Loops 

In the functional approach to program derivation, the final product of a 
calculation is an expression denoting a mathematical function. This expres- 
sion still has to be translated into a specific programming language in order 
for it to be executable by computer. One obvious candidate is a, functional 
programming language, such as ML or Miranda 1 . However, there is no rea- 
son why the final expression should not be translated into a conventional 
imperative language. For example, a left reduction can easily be translated 
into a loop. Using hopefully straightforward notation, the value © j4 e x is 
the result delivered by the following imperative program: 

I [ var a; a := e; 
1 Miranda is a trademark of Research Software Ltd. 
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for b in x 

do a := a oplus b; 

return a ] I 

Here, the 'generator' b in J successively assigns to b the elements 
of x in order from left to right. 

3.4 Left-zeros 

Both the imperative and functional implementations of left reductions re- 
quire that the argument list be traversed in its entirety. Such a traversal 
can be cut short if we recognize the possibility that an operator may have 
ieft-zeros. By definition, u> is a left-zero of © if 

u © a = u> 

for all n. An operator may have none, one or many different left-zeros. If w 
is a left-zero of ffi, then 

© j\j x = w 

for all x. Since 

© / e (z -H- y) = © y where e' = © j> e x 
it follows that 

© (s -H- y) = © j>e x 

whenever the right-hand side is a left-zero of ©. In words, evaluation of a 
left-reduction can be terminated on encountering a left-zero. 
Suppose lzero 9 is a predicate that determines whether its argument is a 
left-zero of ffi. Using hopefully equally straightforward notation as before, 
we then have that }/ e i can be evaluated by the program 

I [ var a; a := e; 

for b in x while not lzero(a) 
do a : = a oplus b ; 
return a ] I 

Before seeing an application of this idea we need a simple yet powerful result. 
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Lemma 3 (Specialisation) Every homomoiphism on lists can be expressed 
as a left (or also a fight ) reduction. More precisely, 

©/•/* = 0/e 

where e = id# and 

a 0 6 = affl/6 

We omit the simple proof. 
3.5 Minimax 

Let us return to the problem of computing 

minimax = 1/ • f/* 
efficiently. Using the specialisation lemma, we can write 

minimax ~ O^oc 
where oo is the identity element of J,/, and 

a 0 x = a i (T/z) 
Since J, distributes through | we have 

o0i= T/(a|) * x 
Using the specialisation lemma a second time, we have 

a O x = ©a x 
where — oo is the identity element of f and 

b ® a c = 6 T (i I c) 

Now, a is a left-zero of ffl a (and so, by the way, is oo), and -co is a left-zero 
of ©. This means we can implement (minimaz xs), where xs is a list of lists, 
by the loop 

|[ var a; a: = infinity; 

for i in is wbile a O -infinity 
do a := a odot x; 
return a ] I 
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where the assignment a : = a odot x can be implemented by the loop 

I [ vax b; b; = -infinity; 
for c in i while b <> a 
do b := b max (a min c); 
a := b ] I 

3.6 The alpha-beta algorithm 

We now generalise the minimax problem to trees. Consider the data-type 

tree::— Tip num\ Fork [tree] 

The syntax of this declaration is that of the functional language MiTanda, 
and it is also employed in the notation of Bird and Wadler (reference [2] 
of Lecture 1). It says that (Tipn) is a tree for each number n, and that 
(Fork ts) is a tree whenever ts is a sequence of trees. The primitive functions 
Tip and Fork are called the constructors of the type tree. 

We wish to calculate an efficient algorithm for computing a function 
eval : tree — » iwim, where 

eval( Tip n) = n 
eval(Forkts) = y(-eval) * ts 

Here we use the notation — / for the function defined by (-f)a = — (/ a). 

Using the specialisation lemma on the right-hand side of the second 
equation for eval, we obtain 

eval(Forkts) — © 7 4_ 00 ts 

where 

a © £ = a 1 (-eval t) 

We now expand this last equation by considering the two possible forms for 
a tree t: 

a © (Tipn) = a](-n) 

a 9 (Fork ts) = a ] (-(1 /(-eval) * ts)) 

The last equation can now be simplified using the laws 

-(«T&) - (-a)i(-b) 
a](blc) = (a T &) Kate) 
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We obtain 

a © (Fork ts) — !/(af) * * ts 

After usiDg the + distributivity law, the right-hand side of this equation is 
also a candidate for specialisation. We have 

a © (Fork ts) = ® fl -f>^ ts 

where 

6 ®a i = b | (a | eW t) 

Furthermore, since 

ewai J = oo I (— oo f evali) 
- oo t 

we have — without inventiveness — reduced the problem of calculating eval t 
to that of evaluating 6 ®a t for values of a and 6. 

Let u6 now expand the definition of 6 ® a t in a similar way as we did for 
a®t. We obtain 

b® a (Tipn) = 6 J, (a jit) 

b® a (Forkts) = b I (a T (|/(-eud) + £s)) 

In oder to simplify the right-hand side of this last equation, we need the 
dual distributive law 

H(«Tc) = (H*)T(Uc) 

and the fact that evaluation of 6 ® a i is required only for values of a and 6 
satisfying a — a { b; in other words, for a ^ b. In such a case we have 

b|(aTc)=at(Alc) 

by commutativity of j. 
We then obtain 

6®a (Fori is) = a T (t/(H) * ^) 

Using specialisation yet a third time, we obtain 

b ®a (Fori ts) = ©j, ts 



35 



where 

a Q^t = al (b I (-evalt)) 

At this point, the seemingly endless succession of expansion and specialisa- 
tion steps can be stopped. A short calculation using the given properties of 
— , T and I yields 

-(-<*) * 

Introducing 

bval a 0 t = 0 ® Q t 
and putting the resulting equations together, we obtain 

evalt = bval (— 00)00 t 

bval a 0 (Tip n) = 0 I (a T n) 

bval a 0 (Fork ts) = ®p-j* a ts 

a'e^t = -bval(~0)(-a')t 

Finally, we bring on the left-zeros. We only need to observe that 0 is a 
left-zero of This follows from the definition of ©^ and the absorbtive 
law 

0T(/U7) = 0 

Incorporating this optimisation yields the alpha-beta algorithm. 

The various axioms concerning ( j, J., — , 00, —00) used in the above derivation 
are precisely those of a Boolean algebra. 

3.7 References 

An alternative, and arguably less satisfactory, treatment of the alpha-beta 
algorithm occurs in 

[1] Bird, R.S. and Hughes, R.J.M. The alpha-beta algorithm: an exercise 
ia program transformation. Information Processing Letters, 24 (1987) 
53-57. 

The importance of the purely algebraic notion of left-zeros and its conse- 
quences for optimisation was discussed in 

[2] Meertens, L. First steps towards the theory of Rose trees. (Unpub- 
lished draft), CWI, Amsterdam, 1988. 

This paper also contains another treatment of the alpha-beta algorithm. 



36 



4 Arrays 

4.0 Problem 

Given is an array x with elements in the set {0, 1}. Required is an efficient 
algorithm for computing the area of the largest rectangle (i.e., contiguous 
subarray) of i, all of whose elements are 1. 

4.1 Binoids 

Suppose a is a set of values dosed under two partial operations + and x 
such that: 

(i) + and x are associative, in the sense that each of the equations 

(o+b)+c = a+(b+c) 
(ax b) x c — a x (b X c) 

hold whenever both sides of the equation are defined; 

(ii) + and x satisfies the further equation 

(a + b) x (c + d) - (a x c) + (b x d) 

whenever both sides are defined. (We shall refer to this property by 
saying that + abides with X. The reason for this choice of name will 
appear shortly.) 

There is no standard terminology for such an algebraic structure so, for the 
sake of a name, we shall call it a f>ino/d. Here are foul examples of binoids, 

ExampJe 1. Let $:«xcnabe associative and commutative. Then © 
abides with itself, and so («,©, ©) is a binoid. 

ExampJe 2. Recall that the operator •< is defined by a < b = a. Note that 
•< is associative but not commutative. Nevertheless, the structure (a, <C, <£) 
is a binoid. Since both sides of 

{a < 6) < (c < d) = (a < c) < (b < d) 

reduce to a, we have that < abides with itself. Similarly, («,>,>>) is a 
binoid, where a >■ 6 == b. 
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Example 3. The structure (o, >, <c) is a binoid. We have that both sides 
of 

(a > fc) < (c >tf) = (a <£ c) » (6 < d) 

reduce to i. 

Example 4. Define the partial operator • by the equation 

o • b = a provided a — b 

The operator • is associative because 

(a • b) • c = a • (6 • c) 

whenever both sides are defined, that is, when all three values are the same. 

Let ffi : q x a — > o be some associative operator. We claim that (o, ffi, •) 
is a binoid. We have 

(a © b) • (c ffi d) = (a • c) © (b ■ d) 

whenever both sides are denned. The right-hand side is denned just in the 
case that a = c and 6 = d. Its value is then a © b. The left-hand side 
is denned just in the case that a © 6 = c ffi d and its value is then a © 6. 
Notice that the left-hand side can be defined without the right-hand side 
being defined. 

4.2 Arrays 

The type of arrays with elements from a will be denoted by \a\. This 
type is specified by a free algebra generated from elements of a under the 
assignment | j : o — » |a| which maps elements of or to singleton arrays. The 
constituents of this algebra are: 

(1) Two operators e (pronounced 'above 1 ) and <)> (pronounced 'beside') 
such that (|a[, ■©,<(>) forms a binoid. The abide property 

(x e y) $ (u e v) = (x <t> u) e (c <)> d) 

can be pictured as 

fx lu \ = fx\u\ 

\y \v ) \y\v) 

(The name abide is an abbreviation of 'above- beside'.) 
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(2) Two functions height and width with types 

height, width : \a\ -> N + 

where N + denotes the positive integers. Informally, height returns 
the number of rows in the array, and width the number of columns. 
These functions are are such that x $ y is defined just in the case 
that height x = height y, and x o y is defined just in the case that 
width x — width y. Moreover, we have 

height\a\ = 1 

height(x « y) — height x + height y 

height(x <)> y) — height x • height y 

width \a\ =1 

width(x €► y) = widthx • width y 

width{x <(> y) = width x + width y 

where • is the operator described in Example 4 above. 

In displayed examples we shall use round brackets to indicate arrays. For 
example, the array 

/ 1 2 3 \ 

4 5 6 
\ 7 8 9 ) 

has height and width 3. It is the array described by the formula 
(|1| 4> |2| «t> |3|) « (|4| <t> |5| * |6|) e (|7| * |8| * |9|) 

as well as many others. 

By definition, a row vector is an array of unit height, and a column 
vector is an array of unit width. 

Note, finally, that we choose not to define the concept of an empty array. 

4.3 Map 

We shall use the same symbol * for mapping over arrays as for mapping over 
lists. Suppose / : a — ► 0. Then /« : ja| — » is the array homomorphism 
defined by the three equations: 
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f*\a\ = \fa\ 

f*{x&y) = (/*£)«(/*!/) 

/*(z*y) = (/*!)♦(/♦») 

Note that 

height(f * x) = height x 

vridih{f * x) — width x 

The * distributive law 

(/ • g)* = /*-<?* 

is valid for arrays as well as lists. 
4.4 Reduce 

Given two operators ffi, ® : a x a — » a, we can define a reduction operator 
(©, ®)/ for arrays by three equations: 

(©,®)/|a| = a 

= ((e,«)/x)e ((«,«)/») 

(©,»)/(!*!/) = ((e,®)/x)«((© 1 «)/y) 

For these equations to be consistent we require that (a, ©, ® ) forms a birtoid. 
Note that the operator ©, which replaces the 'above' operator comes first 
in the reduction, and the operator ®, which replaces the 'beside' operator 
<t>, comes second. 

For example, ( + ,+)/ sums the elements in an array of numbers, and 
(V, V)/ determines whether there exists a true entry in an array of booleans. 
Both + and V are commutative and associative operators, so the binoid 
condition is satisfied. 

Ab further examples of array reductions, we have 

height = (+,•)/ • Xi* 
width = (•,+)/ ■ Ki* 
area = (+,+)/• Xi* 

where • was defined in Example 4 above. We also have 

area x — height x X width x 
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The top-left element of an array is given by 



topleft = (<€,<)/ 



Similarly, we can define reductions that return each of the other corner 
elements. Examples 2 and 3 above show that the binoid condition is satisfied. 
The identity function on arrays is given by 



defines the operation of array transposition. A simple but important fact is 
that tr is its own inverse, that is, 



We shall prove this fact below. 

Finally, notice that when reducing over a row or column vector one of 
the operators in a reduction is redundant. Accordingly, we shall write ©/ 
for reduction over vectors. The notation is the same as for reducing over 
lists, but the ambiguity is not harmful since row and column vectors can be 
regarded as lists. 

4.5 Promotion 

The one-point and join (or promotion) rules for lists have counterparts in 
the theory of arrays. (There is no analogue of the empty rule because we 
have not defined the concept of an empty array.) These rules are: 

one- point rules 



id = (e,*)/ 




The companion function 



tr = (»,«)/ 



tr - tr — id 



f*-\-\ = V\-f 

(©,»)/ -H = id 
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transpose rules 

/*•(*,«)/ = (♦.»)/•/** 

(©,»)/■ (*,e)/ = (»,©)/ ■ (©,»')/* 

Notice in the last rule that the order of the operators in the outer reduction 
is reversed. These six rules can be proved by induction, using the definitions 
of + and / over arrays. 

To illustrate these rules, we calculate: 

tr ■ tr = definition of tr 

(♦,*)/. H* ■(♦,«)/■ H* 

= map transpose rule 

(♦,»)/ - (4>,e)/ ■ |-|** ■ 1-1+ 
= reduce transpose rule 

(e,*)/.(*,e)/*.|-|**-|-|* 
= * distributivity 

(©,♦)/. ((♦,«)/■ |-|*-|-|)* 

= map one-point rule 

(>,♦)/ ■((♦,*)/ -l-l-l-D* 

= reduce one-point rule 

(«,*)/■ l-l* 
= definition of i<i 
iVi 

4.6 Zip 

Let us return for the moment to lists. For each binary operator © : ax/J — * 7 
we defiae a partial operator Y® (pronounced 'zip with ©') informally by the 
equation 

[01 , 02, . .., On] Yffi [&1 . h, M = © ftl > A3 © h, ...,On © &«] 

Thus, a; Y® f is defined only in the case that #r = The type of Y© is 
given by 

Ye : [a] x [0] - [7] 
Among the many properties that Ye enjoys is the fact that Y® ' s associative 
if © is; similarly, for commutativity and idempotence. 
Formally, we can specify Y® by three equations: 

[]Y.[] = [] 

[a]Y®[fc] = W®b] 

-H- !f) Y® (« -H- f ) = (sY««)-H-(lfYe») 
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The third equation is asserted only under the conditions that #z = #u and 
#y = It is, of course, just the condition that Y® abides with -H-. 

The same operator can be defined on arrays (and, indeed, on many other 
data structures). We have 

M Ye 1*1 = 

(z«k) Y®(u«f) = ( x Yffi u ) «• (y Ye «0 

(x * y) Y® (u * «) = {zYe«)*(sY®«) 
The last two equations require that i is the same shape as u, and y is the 
same shape as v. So, ^ abides with Y®i a nd so does <J>. 

Some useful examples of this operator are as follows. First, the function 

r OU « = {e,Y*)/ ■ INI* 

converts am array into a column vector whose entries are row vectors, one 
for each row of the array. For example: 



rows 



Similarly, the function 

converts an array into a row vector, each entry being a column vector. 
The related functions 

listrows = 0,Y+)/ ' [Ml* 
listcols = (Y+,*)/ • [[•]]* 

each convert an array into a list of lists. The function listrows turns an array 
into a list of rows, each row being a list of entries from a row of the array. 
The function listcols turns the array into a list of columns. We have 

height — # ■ listrows 
width = # ■ listcob 

Other pairs of identities one can establish are the following: 

listcols = listrows • tr 
listrows = listcols ■ tr 




( 


' 1 


2 


3 ] 


\ 




4 


5 


6 




\ 


7 


8 


9 J 


/ 



(©, ®)/ = ®j ■ ®/+ • listrows 
(©,&)/ = ®/ ■ ©/* ■ listcols 
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4.7 Directed reductions 



There axe a number of different issues involved in setting up the idea of 
directed reductions on arrays. First of all, there is the question of how 
many operators should take part in a reduction: one or two. Second, there 
is the question of how many directions one might wish to identify as being 
useful. 

We shall adopt the following solutions to these questions. Each directed 
reduction will involve a single operator, and we focus on two legitimate 
directions: top-down (top reductions), and left-right (left reductions). We 
shall use the notation ©^r for a top-reduction, and ffi-|» for a left reduction. 
(Since top reductions work downwards, we use a downward arrow, and since 
left reductions work rightwards we use a rigiitward arrow.) 

The effect of a top reduction is illustrated by 



1 2 3 \ 
4 5 6 
7 8 9 j 



= ((1®4)®7 (2®5)®8 (3®6)®9) 




Thus ffi \ x produces a single row as its result. Similarly, 

/ (l©2)ffi 3 
= (4© 5) ©6 
\ (7® 8)® 9 i 

so ffi -|* x produces a single column. 

Two basic examples of the directed rednctjons are provided by the fol- 
lowing alternative definitions of rows and cols: 

rows - (<(>-|») ■ |-|* 
cols - (©i) • |-|* 

One consequence of the given forms of directed reductions is that if we 
compose 

(©i) ■ («+) 

in sequence, then the result is a singleton array. To extract this value, 
suppose we define the : \a\ — » a by 



the \a\ - a 
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Now we can state the fact that every general reduction can be expressed as 
the composition of two directed reductions: 

(©,«)/ - the - (©+) • (®4>) 
(ffi,®)/ = *e -(«+)-(©*) 

We have not yet given the formal definitions of these two new reduction 
operators. We give the equations for a top reduction: 

(©±)|«| = |a| 

(ffi±)(s ■© = |6©fl| where b = tAe(© ± x) 

(ffi±)(z * ff) = (®± *)♦(©*!/) 



4.8 Accumulations 



Reasonably enough, corresponding to the top and left reduction operators 
are the top and left accumulation operators. These are denoted, respectively, 
by ffi^ and ©-jf ■ Their effect is illustrated by the following examples: 
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Thus, if © : a X or 



Left accumulations on arrays are related to left accumulations on lists by 
the equation 

listrows ■ (©-|f ) = (ffi-^)* • listrows 

Similarly, 

listcoh • (©£) = (©1^)* • kstcols 
The formal definition of ©^ is given by the equations 

©*M = H 

© t e = (© ^ x) e |6 © a| where b — bottomleft(® £ x) 
®i (**!/) = iff) 
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The function bottomleft (which returns the bottom-left corner of an array) 
is defined by 

bottomleft = (>,<)/ 

In order to relate array accumulations to array reductions, in the way that 
list accumulations were related to list reductions in the first lecture, we need 
to consider the array analogues of writs and tails. This we do next. 



4.9 Tops and bottoms 

There are four reasonable ways of dissecting an array: we shall call them 
tops, bottoms, lefts and rigkts. Two of them are illustrated by the following 
examples: 






tops 



\ 

The essential points here are that lefts and rights each return a single row, 
while tops and bottoms each return a single column. 

We can give the definition of lefts and tops in terms of accumulations: 

lefts - (*-|) ■ cois 
tops — ■ rows 

These equations should be compared to the definition 

into* = (+#)• [■]* 

of Lecture 1 . 

As one might expect, there is a multitude of relationships between the vari- 
ous forms of array reductions, array accumulations, and the four dissection 
functions introduced above. In particular, we have the 
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orthogonal reduction rules 



(©±)* • lefts = lefts 
(©-}»)* " 'op* = *°P a 



•(©i) 
■(©+) 



— plus a further two equations obtained by replacing lefts by rights and 
tops by bottoms. 

We also have another group of rules: 

orthogonal accumulation rules 



— plus a further two rules obtained in the same way as before. 

Finally, we have the analogues of the accumulation lemma of Lecture 1. 

Accumulation lemma 



4.10 Horner's rule 

The alert reader might at this point be wondering if there is an analogue 
of Horner's rule that works for arrays. Since tails corresponds to bottoms 
(and also to rights), the expression to be simplified in the array version of 
Horner's rule is 



It turns out that the conditions we need are: (i) that ® distriubtes (back- 
wards) through ffl; and (ii) ffi abides with 0. The first conditionis the same 
as in the case of Horner's rule for lists, but the second condition is new. If 
these two conditions axe met, then 



where ® is defined (as in the version of Horner's rule for non-empty lists) 



• lefts - lefts 
(©-If)* • tops = tops 



-(©*) 
■(«+) 



■ tops = rows ■ 
(©-(*)* ■ lefts = cols ■ (ffi-||> ) 



(© ■ ©)/ • (®,0)/+ ■ bottoms 



(©,©)/ ■ {9,0)/* ■ bottoms = (©,©)/ • ®i 



by 



a®b = (a®b)sb 
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We shall illustrate Horner's rule rather than give a forma] proof. Consider 
the array 




The left-hand side of Horner's rule applied to x gives 

(( 1 ® 3 ® 5) 0 (2 ® 4 ® 6)) © ((3 ® 5) 0 (4 ® 6)) © (5 O 6) 

Using the fact that ffi abides with 0, we can write this in the form 

((1 ® 3 ® 5) © (3 ® 5) © 5) 0 ((2 ® 4 ® 6) © (4 ® 6) © 6) 

Now, since ® distributes over ©, we can write the above expression in the 
form 

((((1 ® 3) © 3) ® 5) © 5) 0 ((((2 ® 4) © 4) ® 6) © 6) 
Using the definition of ©, this simplifies to 

((1® 3)® 5)0 ((2® 4)® 6) 

and thus to 

(Q,0)/(®±x) 

where z is the original array. We shall see an application of this form of 
Horner's rule in the last section. 

4.11 Rectangles 

By definition, a rectangie of an array x is a contiguous subarray of x. Thus, 
rectangles are to arrays as segments are to lists. In this section we put 
the four dissection functions together in order to define the rectangles of an 
array. 

First of all, we give some arrays of arrays that will prove useful. The 
top-lefts (topis) of an array is defined by 

topis = • tops* • lefts 
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Similarly, we have the bottom-rights (botrs) defined by 
botrs — («,<(>)/ • bottoms* ■ rights 



The horizontal -segments (h3egs) are defined by 



hsegs — (e,<t>)/ • bottoms* ■ tops 



and the vertical-segments (vsegs) by 



vsegs = (e-, <(>)/ - rights* ■ lefts 



We can put these functions together to define the rectangles of an array: 



Thus, recto returns an array whose elements are the rectangles of a given 
array. We shall see in the next section that there are various equivalent ways 
to define rects; in particular, 



defines exactly the same function. 
4.12 BRTL rules 

It is an inevitable consequence of the extra dimensionality provided by arrays 
that the number of possible algebraic identities goes up by a multiplicative 
factor. Too many identities are almost as much a problem in program cal- 
culation as too few. In this section we describe, in effect, no fewer than ten 
additional equations. Fortunately, they come in logical groupings. 
We start with the fact that 



recto = (»,$)/ • botrs* ■ topis 



recta ~ (e,<t>)/ • hsegs* ■ vsegs 



(e, $)/ ■ tops* ■ lefts = (»,*)/ - lefts* ■ tops 
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There are three additional equations of this kind: we can replace tops by 
bottoms and lefts by rights. 

From these four equations we can generate four more. Two of them are: 

(e,<J>)/ • vsegs* • tops = • tops* ■ vsegs 

(e,<j>)/ • hsegs* ■ lefts = • lefts* ■ hseg3 

The two additional ones are obtained by replacing tops by bottoms and lefts 
by rights. 

Finally, to the above eight there can now be added: 

(©,*)/ • botrs* ■ topis = («,$)/ • vsegs* • hsegs 
(e,$)/ - vsegs* ■ hsegs = (&,$)/ ■ hsegs* • vsegs 

That gives ten. A good way to remember these identities, particularly the 
last two, is to see that the relative order of bottoms (B) and tops (T), and 
the relative order of rights (R) and lefts (L) remain unchanged throughout 
all the equations. For example, the last two rules can be captured with the 
abbreviations 

BR- TL = RL ■ BT 
RL ■ BT = BT ■ RL 

Similar abbreviations hold for the other rules. 

For a sample proof we shall tackle the fifth equation. For convenience, 
we introduce the additional abbreviations: 9 for (•&,*)/ and V for vsegs. 
The following chain of equations] reasoning is given without comments. 
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4.13 Rectangle decomposition 

Let us now turn to the problem of computing R, where 

R = (ffi,©)/ •/* • rects 

We want to show that if 

B - (©, ©)/ • /* • bottoms 
can be expressed in the form 

B = g ■ 

for suitable g and ®, then R can be expressed in the form 
R = (©,©)/ ■ ft* • rows - 

where k is defined by 

ft = (©,©)/ ■ g* ■ vsegs 

The advantage of this expression for R is that if ft (which is essentially a 
problem about the segments of a list) can be computed in linear time, and 
if values of (*> can be computed in constant time, then R can be computed 
in time proportional to the number of elements in the given array. 
We shall need the following observation about rects: 

rects = one definition of rects 

(e, 41)/ ■ vsegs* • hsegs 
— definition of hscgs 

(e,4))/ • vsegs* ■ (•&,<)>)/ • bottoms* - tops 
= map and reduce promotion 

(©,41)/ ■ ((e,4>)/ - vsegs* • bottoms)* ■ tops 
= BRTL rule 

(©,41)/ • ((©,40/ • bottoms* ■ vsegs)* ■ tops 
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Now we argue: 



R — observation 

(©,©)/ - /+ • (■©,*)/ • ((■&,*)/ • bottoms* ■ vsegs)* ■ tops 
= map and reduce promotion 

(©,©)/ • ((©,©)/ • /* ■ • bottoms* • vsegs)* ■ tops 

= promotion rules 

(©,©)/ - ((©,©)/ • ((©,©)/ ■ /* - bottoms)* ■ vsegs)* • tops 

- assumption on B 

(©,©)/ ■ ((©,©)/ • {g • ■ vsegs)* ■ tops 

= * distributivity 

(©,©)/ ■ ((©,©)/ ■ g* ■ ■ vsegs)* ■ tops 

= orthogonal reduction rule 

(©,©)/ • ((©,©)/ • g* ■ vsegs • (®i))* • tops 
= * distributivity; h = (©,©)/ • g* • vsegs 

{©,©)/ ■ h* - {®D* ■ tops 

— accumulation lemma 
(©,©)/ ■ h* ■ rows ■ (®$) 

4.14 Application 

At long last we are in a position to make progress on the problem posed at 
the beginning of this lecture. The problem is to compute iZ, where 

R = \j ■ area* ■ filled"! ■ bag • rects 

given that R is applied to arrays whose elements are 1 or 0. 
The function bag is defined by 

bag = -X-]* 

Since we have not defined the filter operation on arrays (and cannot reason- 
ably do so if the result is to be an array), we must first make a bag out of 
the array of rectangles in order to filter out those rectangles we do not want. 
The predicate filled caD be defined by 

JHI«d = (A,A)/.(=l)« 

The area of a rectangle can be defined by 

area= (+,+)/ ■ #i* 
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In fact, we can make a convenient simplification and eliminate the filter 
from the specification of R. Define a modified addition operator © by the 
equation 

a © i> = 0 ifa = 0Vb = 0 
= a + b otherwise 

This operator is both associative and commutative. If we define 

area' - (©, ©)/ 

then it is easy to show, provided x is an array over {0,1}, that 

area' x — arcax, if filled x 
= 0 otherwise 

Thus we can eliminate the filter, and reformulate the problem as one of 
computing R, where 

R = (t-T)/ • (©.©)/* • recta 

For an array filled with zeros, the new version of R returns zero (while the 
previous version returned — oo); otherwise the functions are the same. 

The new form of R is such that we can try to apply the rectangle de- 
composition theorem of the previous section. This means that we have to 
express 

B = (T,T)/ ■ (©,«)/* ■ bottoms 

in the form 

B = g ■ 

for suitable g and ®. 

A good place to start is to try and use Horner's rule. Fortunately, the 
first condition of Horner's rule, namely that © distributes through |, is easily 
seen to be satisfied. Unfortunately, the second condition, which is that ffi 
abides with f, is not satisfied. 

All is not lost, because we have not yet used the fact that we are only 
considering arrays over {0, 1}. The first crucial observation is that, provided 
x is an array over {0, 1}, we have 

(©,©)/x = vridthx x 

The restriction on x is necessary to ensure that the reduction (ffl,|)/a; is 
well-defined. The abide condition, namely that 

(a® b) l{c® d) = (a I c) ® (b I d) 
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is only valid when at least one of a, b, c, d is zero, or all are non-zero and 
equal. If this condition is true of the elements of x (i.e. all non-zero elements 
are equal in value), then the abide condition is satisfied. For such an x we 
therefore have 

B(x) = width ix(T, T)/(©> I)/ * bottoms x 

Furthermore — and this is the crux — under the same assumption about 
the elements of x, we have that j abides with |. This gives us Horner's rule. 
We can summarise the above observations in the following way. Define 

gx = width x X |/z 

and 

Equivalently, we can define ® by 

a®b - 0 if 6 = 0 
= a + b otherwise 

Then, provided x is an array over {0, 1}, we have 

(T. T)/(©> ©)/ * bottoms x = g(® ± x) 

A formal proof of this result can be given by induction and is left to the 
reader. 

Now, using rectangle decomposition, we obtain 
R — |/ - ft* • rows • (®^) 

where 

h — y ■ g* ■ vsegs 

We can reformulate k as a function on lists. Replace rows in the expression 
for R by listrows , and vsegs in the definition of ft by segs . We then have 

ft - 17 ' /* ' se 9 s 

where 

fx = #xxl/x 

The new definition of ft has a simple intuitive interpretation. Think of 
a sequence x of nonnegative numbers as representing a histogram. The 
function ft then computes the largest rectangular area under the histogram 
x . In the final lecture we shall show how to compute ft in linear time. Since 
® is computable in constant time, it then follows that R can be computed 
in time proportional to the number of entries in the array. 
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4.15 References 



The largest filled rectangle problem was invented as a generalisation of a 
similar problem about largest filled squares. This largest filled square prob- 
lem was posed and solved in 

[1] Gries, D. A note on a standard strategy for developing loop invariants 
and loops. Science of Computer Programming 2 (1982) 207-214 

After solving the largest filled rectangle problem, the author learnt of an- 
other solution (expressed as a five-pass algorithm) described in 

[2] Dean Brock, J. Finding the largest empty rectangle on a grated surface. 
Proc. 4 th Annual Symp. on Tieoretica/ Aspects of Computer Science, 
Passau, W. Germany — in LNCS, No. 247, Springer- Verlag, Berlin, 
(1987) 66-75. 

5 Trees 

(Note: The material of this section is the result of joint work with Jeremy 
Gibbons of the PR.G, Oxford, For reasons of space, most of the ideas are 
sketched only briefly. A fuller account will appear elsewhere.) 

5.0 Problem 

Define a heap to be a labelled binary tree ( with the property that for each 
subtree t' of t, the label of t' is a number which is no greater than the 
labels of all subtrees of t' . Required is an efficient algorithm for converting 
a sequence of numbers i into a heap whose inorder traversal is x. 

5.1 Trees 

The type of labelled binary trees (henceforth, just called trees) with labels 
from q wiU be denoted by (or). This type is specified by a free algebra 
generated from elements of a under the assignment (■):«*—• (a) which 
maps elements of a to singleton trees. The constituents of this algebra are: 

(1) Two partial operations I (pronounced 'under') and ^ (pronounced 
'over') such that / associates with V m t fle sense that 

(x I y) \ z = x I (y \ z) 

whenever both sides are defined. 
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(2) Two predicates noieft and noright such that x / y is defined just in 
the case that noieft y holds, and x \ y is defined just in the case that 
noright x holds. Moreover, we suppose that 

noieft {a) = True 

noleft(z I y) — False 
noleft{x V y) - noieft x 

noright {a) = True 
noright(x / y) — noright y 
noright(x \ y) = False 

Another way of putting these conditions is to say that noieft x holds just in 
the cases that x is a singleton tree, or x is of the form (a) \ y. Similarly, 
noright x holds just in the cases that x is a singleton tree, or x is of the form 
y I (a). It follows that the expression 

i Jy \z 

is well-defined just in the case that y is a singleton tree. 

Let us relate these operations on trees to the usual pictures. For example, 
the tree 

{{a)J(b)\(c})J(d)\(e} 
can be drawn in the following way: 




On the other hand, the tree 

W/(i)\(W/WlW) 

can be pictured as follows: 
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0 © 



We have not yet defined the notion of an empty tree. If we wish to Lave 
such a tree, then we can denote it by ( ) and suppose that ( ) is the unique 
left identity of J and the unique right identity of \. Thus 

()Jx = x 
x\() = X 

for all non-empty trees x. We also suppose that x J { ) and (} \ i are 
undefined, for otherwise 

and so J and \, no longer associate. 



5.2 An alternative view 

An alternative, and more common, view of labelled binary trees is in terms 
of a ternary constructor Bin. This view is captured in the type declaration 

tree a ::= Nil \ Bin (tree a) a (tree a) 

(The syntax used here is that of the functional language Miranda; it is also 
used in the text j3] cited in Lecture 1.) This declaration defines a tree to 
be either the empty tree Nil, or a tree Bin x ay with left subtree i, label a, 
and right subtree y. 

We can move from the ternary view to the binary view by using the 
equivalences 

Ail = () 

Binx ay = x J (a) \ y 



57 



A slightly different view of trees excludes the empty tree. This view recog- 
nises trees as being of one of the following forms: 

(«) 
x /(a) 

The advantage of viewing labelled binary trees in terms of two partial binary 
operators, rather than a single total ternary operator, is primarily that we 
can set up the notion of redaction on trees in a simple manner. However, 
each view of trees has its advantages and disadvantages, so we shall employ 
whichever is the more convenient in a given situation. 

5.3 Map and reduce 

The next move, which should he familiar by now, is to define the map and 
redaction operators for trees. The definition of * is given by three equations: 

/*(«) = (/«> 
f*{x\y) = (f*x)\(f*y) 

For reduction we need two operators ffi and 0 such that © associates with 
®. We then define 

(«,»)/(* /y) = ((©,»)/*) e ((©,«)/») 
(«,«)/(* \jr) = ((©,«)/*)»((©,«)/») 

These equations define reduction over non-empty trees. If, in addition, © 
has a unique left identity element e, and e is also the unique right identity 
element of ®, then we can set 

{«,«)/<) = e 
Let us now consider some examples of reduction. 
(1) The iabe/ of a tree is given by 

label = (>, <)/ 
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The operator ^> associates with for both sides of 

a > (b < c) = (a > b) < c 
simplify to 6. Note, however, that < does not associate with 3>, that is, 

a<(b> c)fi(a< 6)»c 
Here, the left-hand side reduces to a, but the right-hand side reduces to c. 

(2) The inorder traversal of a tree is defined by 

inorder = (4f,-H-)/ • [■]* 

(3) The size of a tree is defined by 

size^ {+,+)/ ■ K-i* 

(4) The function member a:, which determines whether x appears as a label 
in a given tree, is defined by 

member x = (V,V)/ • (= x)* 

In the examples above there is only one operator in the reduction, and this 
operator is associative. The reduction is well-defined because an associative 
operator associates with itself. Moreover, each operator possesses an identity 
element (which is therefore the unique left and right identity element of the 
operator), and so each of the above functions is defined on the empty tree. 

Here are some examples where the operators in a reduction are not the 
same. 

(5) The depth of a tree is defined by 

depth = (9,e)/ • Ki* 
where the operator © is defined by 

a®6 = (l + a)T& 

and, as usual, a© 6 = 6 © a. Neither © nor © is associative, but nevertheless 
© associates with ©. The proof of this fact is left to the reader. 
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(6) A similar function is keaporder, defiued by 

heaporder = (©,©)/ ■ [-]* 

where © is defined by 

x © ([a] -H- y) = [a] -ft- (x ft y) 

We leave to the reader the proof that © associates with ffi, and that [] is 
the unique left identity of © (and therefore the unique right identity of ©). 
Thus, we have 

heaporder ( ) = (] 

An alternative definition of heaporder can be based on the ternary view of 
trees: 

heaporder ( ) = [ ] 

heaporder ( x J (a) \ y) — [o] -H- (heaporder x /ft heaporder y) 

By definition, a tree a; is a heap if heaptreex holds, where 

heaptree = nondec ■ heaporder 

Here, nondec is a predicate that determines whether a sequence is in non- 
decreasing order. An alternative definition of heaptree is given by the equa- 
tions: 

heaptree(a) = True 

heaptree(x J y) = heaptree x A heaptree y A label x ^ label y 
heaptree(x \ y) — heaptreex A heaptree y A label y < label x 

5.4 Accumulations 

Let us now very briefly consider accumulations on trees. There are two kinds: 
upwards (or towards the root) and downwards (or towards the leaves). For 
reasons of space we consider only the first kind. 

An upwards accumulation (on non-empty trees) is denoted by A> an ^ 
takes a pair of operators (®,®) as its left argument, and a tree x as its 
right argument. The result is a tree of the same shape as x. The function 
up = (ffi, ®)X is defined by the equations: 

up{a) = {a) 

up(x I (a)) = up x J (b ©a) 
u p((a)\y) - {a®c)\upy 
up(xj{a)\y) = upz J (b®a®c)\upy 

where b = label(upz) and c = label(upy) 
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For example, tlie function subtrees with type 

subtrees : (ft) — * ((a)) 

can be defined by 

subtrees — (J, \) A '{')* 

The expression subtrees! evaluates to a tree of exactly the same shape as x 
but whose labels are the subtrees of x. In particular, 

label * subtrees x — x 

We shall use this result below. 

The accumulation lemma for trees states that 

(®i®)/* • subtrees 

The expression on the left can be computed in time linear in the size of the 
tree, whereas the expression on the right takes quadratic time in the worst 
case. 

For example, the function 

treesizes = size* • subtrees 

can be re-expressed as 

treesizes = (+, +) A ' K\* 

and therefore can be computed in linear time. (We leave the reader to 
formulate and prove the necessary subsidiary identity that enable the second 
definition of treesizes to be calculated from the first.) 

5.5 Building a heap 

The problem stated at the beginning was to build a heap whose inorder 
traversal was a given sequence. We are required, therefore, to construct a 
function heap satisfying the equations 

inorder {heap x ) = x 
heaptree(heap x) = True 

for all lists x. In words, the first equation says that heap is a right-inverse 
of inorder, while the second says that heap must return a tree satisfying 
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the heap condition. Note that inorder is surjective, but not injective. For 
example, both the trees 

(1>\«1>Y<2» and <l)y(i)\<2> 

produce the inorder traversal [1,1,2]. Moreover, both trees are heaps, so 
the specification of heap does not determine the function uniquely. Indeter- 
minacy arises because there is a choice as to the relative placement of equal 
values. 

To determine a constructive definition of heap, we can look for a solution 
of the form 

heap = ©/-/* 

In other words, we can look for a definition as a homomorphism over lists. 
Using the specification of heap, it is possible to calculate the values of / and 
ffi. We obtain that / = (■) and that © is determined by the equation 

(xJ{a)\y)®( U Ui>)\v) = xl(a)\(y®(ul(b)\v)) if a < b 

= ((x y (a) \, y) © u) I {&) \ v otherwise 

together with the condition that { ) = id^. We leave to the reader the proof 
that this definition of heap meets its specification. 

In this solution, indeterminacy is resolved by placing equal values to 
the left. More precisely, suppose we define the righ t-spine of a tree by the 
equation 

rspine = (>,4f)/ - [■]* 

Informally, the right-spine of a tree is the sequence of labels obtained by 
starting at the root and proceeding along the right branches to the rightmost 
tip. Then our definition of heap is such that rspine(heap x ) is a sequence in 
strictly increasing order. 

5.6 Solution as a left reduction 

The definition of heap as a homomorphism does not prescribe an order 
of computation. To obtain a sequential algorithm we can specialise the 
definition to a left reduction: 

heap ' j 
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where z ® 6 = i © / b. Simplification yields: 

O&b = (b) 

(z i {«) \ V) ® b = i/(fl)H!f®6) if a < 6 
= ((i I (a) \ y) I (6) otherwise 

A pictorial interpretation of ® is as follows. If i is the heap 




then z ® b will be the heap 
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where j is defined by the condition Oj < b ^ Oj +1 . 

The running time of this algorithm is 0(N' i ), where JV is the length of 
the argument. Each element b may be compared with every label in the right 
spine, and the right spine can increase by one in length at each step. A more 
efficient algorithm can be obtained by comparing b with labels in the right 
spine, starting with the rightmost label and proceeding to the root. The 
amount of processing done at each step is then proportional to the change 
in length of the right spine. This gives a linear time algorithm for building 
the heap. To implement the idea we need a change in representation. 

Consider the function cut defined informally by the equation 

cut(xi \ (X2 V (. . ■ \ z„))) = [zi,z 2 ,---,z n ] 

where each Xj is such that rtorightxj holds. Thus, cut takes an arbitrary 
tree and returns a sequence of trees obtained by removing every right branch 
along the right spine. It is easy to check that cut is a bijective function. If 
we define 

paste = \fy 

(a definition involving a truly horrendous juxtaposition of arrows) then cut 
and paste are inverse functions, that is, 

cut - paste = «fj w j 
paste ■ cut = tef( 0 } 

To implement the change in representation, we can modify the definition of 
heap by writing 

heap = paste - j 
where O is specified by the equation 

(cut x) Q b = cut(x ® b) 
Putting it another way, if 0 and ® are related by the above equation, then 

cut 0/ cut ( ) 

and so, by applying paste to both sides, we get the new equation for heap. 

To complete the change in representation, it remains to synthesise a 
constructive definition of 0 from its specification. Omitting details, we can 
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calculate that 

[]©& = [(b)] 

([x I {a)] -tf xs)e b - \x i {a)] -+f {xs © 6) if o < 6 

= [paste([x I {a)] -+f xs) / (ft)] otherwise 

This effects the change in representation but, in order to achieve tie desired 
increase in efficiency, we still need to change the order in which the elements 
of the left-hand argument of © are processed. 

5.7 Prefix and suffix 

Let us introduce four new operators on lists. They are J ('take prefix'), 1 
('drop prefix'), L ('take suffix') and r ('drop suffix'). Each operator takes 
a predicate on the left and a list on the right. The definitions of p J x and 
pLi are: 

pJx = ]y.jall p o inits x 

pi- x = T#/ a " P ° z 
Both operations can be implemented efficiently so that the number of cal- 
culations of p equals one more than the number of elements in the result. 
The remaining two operators are defined by the equations: 

(p Jz)-¥r{p-\x) = X 
(?ri)f (pLi) = x 

Thus p H x is what remains when p J x is removed from x. A similar 
statement holds for p r x. 

We state without proof the following lemma. 

Lemma 4 Let x be a sequence and p a predicate such that pti is non- 
increasing (taking False < True). Then 

p J i = p r x 

wiere p is the negation of p. 

5.8 A linear algorithm 

We can now use the newly introduced operators in the construction of a 
linear time algorithm for our problem about building a heap. Recall that, 
currently, we have 

heap = paste ■ ©/r i 
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where 

paste = \t() 

and 

[]©* = {{!>)} 

([x / (a)] -H- xs) © b = [i/(a)]*(u0J) if a < 6 

= [pa5te([i y (a)] -H- xs) / (6)] otherwise 

Using the operators J and "1, we can rewrite the definition of 0 in the form 

xs 0 b = ( Pi J xs) -tf [j?o*Mj?(, "l xs) I {b)\ 

where the predicate pt is defined by 

p b (x / (a)) = (a < b) 

Since, by the heap assumption, pt, * xs is non-increasing, we have that 

xs © b = (pj, r xs) 4f [paste(pi, L is) / (6)] 

With the new definition of O, the function heap can be computed in linear 
time. 

5.9 Application 

The representation of sequences by heaps has a number of uses, of which we 
give just one brief illustration. The problem that arose in the last lecture, 
namely to compute the area of the largest rectangle under a, histogram, can 
be formulated as a function 

mra = f/ ■ area* ■ segs 

where 

areax — #z x \jx 

We claim without proof that the function mra can be written in the form 
mra — (TiT)/ ■ area'* ■ subtrees ■ heap 

where 

area' x = size x X label x 
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Now, suppose we define 

areas = area'* ■ subtrees 
Using the results about subtrees cited above, we can then calculate 
areas x = size * subtrees x Yx label * subtrees x 

= ((+,+) A Xi**)Yx* 

Thus areas can be computed in linear time. It follows that 

mra = (J, \)j ■ areas ■ heap 
can also be computed in linear time. 

5.10 References 

The idea of using heaps to solve certain problems about segments can be 
found in 

[1] De Moor, 0. and Swierstra, D. The low segment problem. Presenta- 
tion at WG2.1, Rome, March 1988. 

The largest rectangle under a histogram is a generalisation of Problem 40 in 

[2] Rem, M. Small programming exercises. Science of Computer Pro 
gramming. 1987. 

Rem's problem is to compute the size of the largest square under a his- 
togram; in symbols, 

T/ ■ #* • P< ■ segs, 

where px — (l/z J? 
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